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Background A novel approach is explored for improving causal inference in ob- 
servational studies by comparing cohorts from high-income with 
low- or middle-income countries (LMIC), where confounding struc- 
tures differ. This is applied to assessing causal effects of breastfeed- 
ing on child blood pressure (BP), body mass index (BMl) and 
intelligence quotient (IQ). 

Methods Standardized approaches for assessing the confounding structure of 
breastfeeding by socio-economic position were applied to the British 
Avon Longitudinal Study of Parents and Children (ALSPAC) 
(W~5000) and Brazilian Pelotas 1993 cohorts (W~1000). This 
was used to improve causal inference regarding associations of 
breastfeeding with child BP, BMI and IQ. Analyses were extended 
to include results from a meta-analysis of five LMICs (N^ 10 000) 
and compared with a randomized trial of breastfeeding promotion. 

Findings Although higher socio-economic position was strongly associated 
with breastfeeding in ALSPAC, there was little such patterning in 
Pelotas. In ALSPAC, breastfeeding was associated with lower BP, 
lower BMI and higher IQ, adjusted for confounders, but in the 
directions expected if due to socioeconomic patterning. In contrast, 
in Pelotas, breastfeeding was not strongly associated with BP or 
BMI but was associated with higher IQ. Differences in associations 
observed between ALSPAC and the LMIC meta-analysis were in line 
with those observed between ALSPAC and Pelotas, but with robust 
evidence of heterogeneity detected between ALSPAC and the LMIC 
meta-analysis associations. Trial data supported the conclusions 
inferred by the cross-cohort comparisons, which provided evidence 
for causal effects on IQ but not for BP or BMI. 

Conclusion While reported associations of breastfeeding with child BP and BMI 
are likely to reflect residual confounding, breastfeeding may have 
causal effects on IQ. Comparing associations between populations 
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with differing confounding structures can be used to improve 
causal inference in observational studies. 

Keywords ALSPAC, breastfeeding, causation, cognition, cohort, Pelotas 



Introduction 

Determining causal effects on health and disease from 
observational epidemiology studies is often limited by 
problems with confounding. Even large studies with 
detailed confounder adjustment may still be affected 
by residual confounding.' Although randomized con- 
trolled trials (RCTs) are the 'gold standard' for causal 
inference in public health research, for many modifi- 
able risk factors with potential causal effects, RCTs 
are not feasible or ethical. Furthermore, in disease 
prevention trials, there may be such a long lead-time 
between intervention and outcome that the very large 
and long-term follow-up studies that would be 
needed can only occasionally be carried out. Because 
of the ethical issues of 'experimenting' on participants 
and the costs of RCTs, one would also always want 
to conduct trials on interventions with the strongest 
prior evidence of having a likely causal effect. 
Therefore, leveraging the most robust causal inference 
possible from observational epidemiology is 
important. 

Various methods for improving causal inference have 
been discussed previously.^'^ These include maternal- 
paternal comparisons as tests of intrauterine effects,'^ 
comparing outcomes betweens siblings discordant 
for an exposure (e.g. mothers diabetic for one preg- 
nancy but not the other or where mothers breastfed 
one sibling but not the other);"^'' and a range of genetic- 
ally informative approaches (including twin studies,''"* 
that can additionally explore confounding by genetic 
factors, assisted conception studies^ and Mendelian 
randomization, which uses genetic information as 
instrumental variables for modifiable exposures'"). 
The application of several different approaches to 
any particular research question, building upon stand- 
ard conventional methods potentially improves the 
reliability of causal inferences. 

This article explores an additional method of explor- 
ing residual confounding and improving causal infer- 
ence, by comparing observational associations in a 
cohort from a high-income country (HIC) with 
those from low- or middle-income countries (LMIC), 
where confounding structures are known to differ. ' 
This approach is based on the idea that if an observed 
association is causal, the association should be pre- 
sent in both cohorts, regardless of differing confound- 
ing structures. Associations that are not replicated 
in cohorts with differing confounding structures are 
likely to reflect residual confounding in the cohort 
reporting an association, particularly if the direction 
of the association is consistent with plausible sources 
of residual confounding. This has been suggested 



previously in several studies of non-Western popula- 
tions where associations of breastfeeding and lower 
obesity, previously observed in high-income and 
Western cohorts, were not replicated. ^^^'^ 

We develop this concept further by, first, directly com- 
paring two cohorts [one from an HIC, the British Avon 
Longitudinal Study of Parents and Children (ALSPAC) 
and the other from a middle-income country, the 
Brazilian 1993 Pelotas cohort] with individual-level 
data, using a standardized approach for analyses be- 
tween cohorts. Using this approach, we compare the 
confounding structure of breastfeeding between the 
two settings and explore associations of breastfeeding 
in relation to three child outcomes that have been re- 
ported in the literature: body mass index (BMI), blood 
pressure (BP) and cognitive function (IQ; Intelligence 
Quotient). Secondly, we extend this comparison to a 
recently published meta-analysis of breastfeeding in 
five LMIC cohorts. Finally, inferences made from 
these cross-cohort comparisons will be compared with 
the results from a randomized trial of breastfeeding pro- 
motion'*''''' since, as highlighted, RCTs are the 'gold 
standard' for assessing causality in health research. 

Methods 

ALSPAC 

Participants 

ALSPAC is a geographically based prospective cohort 
study investigating the health and development of 
children.'*^ Pregnant women residing in three health 
districts in the south west of England with an ex- 
pected date of delivery between 1 April 1991 and 31 
December 1992 were eligible to enrol. A total of 
14 541 pregnant women were recruited and 13 678 
had a live-born, singleton child. For this study, we 
excluded parents and children of multiple births. 
Ethical approval of the ALSPAC study was obtained 
from the ALSPAC Law and Ethics Committee and 
three Local Research Ethics Committees. From 
annual clinics that the whole cohort were invited to 
attend, systolic blood pressure (SBP) and diastolic 
blood pressure (DBP) were assessed at 7 years, BMI 
at 9 years and IQ at 4 years. Indicators of socio- 
economic position included in this study were family 
income, maternal education, paternal education and 
family occupational social class. A full description of 
all variables is provided as Supplementary Data at IJE 
online. Data on breastfeeding in infancy are available 
for 10 665 children. Data on BP, BMI and IQ are avail- 
able for 7430, 6868 and 6851 children, respectively. 
The main analyses exploring breastfeeding and child 
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outcomes were carried out in 5193 (SBP/DBP), 4852 
(BMI) and 4891 (IQ) children with data on breast- 
feeding, outcome and potential confounding factors. 

Pelotas 

Participants 

Pelotas is situated in the extreme south of Brazil, near 
the Uruguayan border, with a population of approxi- 
mately 320 000 urban inhabitants, where >99% of all 
births occur in a hospital. In 1993, the five maternity 
hospitals in the city were visited daily and 5249 live 
newborns were enrolled; only 16 mothers could not 
be interviewed. This cohort is described in detail else- 
where.^^ At 1 and 3 months, a systematic sample of 
13% of the subjects was assessed. At 6 months, 1 year 
and 4 years, all low birth-weight infants plus a sys- 
tematic sample of 20% of the remaining — including 
those visited at 1 and 3 months — were targeted for 
follow-up. Breastfeeding data were collected at these 
follow-ups and are available for 1419 children. At 11 
years, the whole cohort was followed up at a home 
visit, where BMI (iV = 4442) and BP (iV = 4434) meas- 
urements were taken. Data on IQ are available in a 
subsample of children at 4 years (« = 614) who were 
invited to take part in detailed psychological assess- 
ments.^" Indicators of socio-economic position 
included in this study were the same as those 
included in ALSPAC, that is, family income, maternal 
education, paternal education and family occupational 
social class. A full description of all variables is pro- 
vided as Supplementary Data at IJE online. The main 
analyses were carried out on 1083 children for SBP 
and DBP, 1085 children for BMI and 506 children for 
IQ, with complete data on breastfeeding, outcome and 
potential confounding factors. Only singleton children 
were included in analyses. 

Statistical analysis 

Breastfeeding was analysed in grouped months and 
with never breastfed and breastfed for <1 month 
combined into a single category. This is because the 
number of infants never breastfed in Pelotas was ex- 
tremely low (3%) and because there is substantial 
misclassification between the two categories.^' We 
aimed to compare associations of socio-economic pos- 
ition and breastfeeding/child outcomes between 
ALSPAC and Pelotas. Since we were not able to trans- 
form the indicators of socio-economic position in each 
cohorts into variables with categories of equal sizes 
(such that they could be compared easily between 
the cohorts), we used indices of inequality [slope 
index of inequality (SII) for continuous out- 
comes; relative index of inequality (RII) for binary 
outcomes]. These indices (SII/RII) relate health out- 
comes to a measure of socio-economic position that 
takes into account the different proportions in each 
category. Each indicator of socio-economic position is 
converted into a variable represented by scores from 0 
to 1, with each category corresponding to a score 



calculated as the mid-point for the proportion of par- 
ticipants in that category based on the cumulative 
distribution. Where 0 is the lowest socio-economic 
level and 1 is the highest level, the scores reflect the 
relative social position of individuals in that category; 
that is, the proportion of individuals with a lower 
socio-economic position. For example, if 10% of the 
participants from one cohort fall in the lowest cat- 
egory for income, participants in this group would 
be allocated a score of 0.05 (0.1/2). If 25% of the 
participants fall in the second category, participants 
in this category would be allocated a score of 0.225 
(0.1+0.25/2), and so on. The SII/RII is then obtained 
by regressing each outcome measure on these 0 to I 
scores. For continuous outcomes, linear regression 
was used giving the mean difference in the outcome 
measure in the highest level (1) compared with the 
lowest level (0) of the socio-economic variable. For 
binary outcomes, logistic regression was used giving 
the odds of the outcome in the highest 
socio-economic level (1) compared with the lowest 
(0). For the main analysis, associations of breastfeed- 
ing duration with child BP, BMI and IQ were 
explored using multiple linear regression, both un- 
adjusted and then adjusted for child sex and indica- 
tors of socio-economic position (family income, 
maternal education, paternal education, occupational 
social class). All analyses involving BP were also ad- 
justed for chUd height; however, as height may be a 
potential mediator of an association between breast- 
feeding and child BP, main associations were also re- 
peated without this adjustment. To explore potential 
effects of missing data from loss to follow-up, associ- 
ations of breastfeeding with indicators of socio- 
economic positions were analysed comparing both 
the restricted sample (with complete data on breast- 
feeding, all confounders and at least one outcome) 
with the maximal sample available. All Pelotas ana- 
lyses were weighted to account for the over-sampling 
of low birth-weight infants. To compare associations 
in ALSPAC and Pelotas, we generated a summary as- 
sociation (per breastfeeding category difference, re- 
flecting the linear trend from one category to the 
next) and then used a Q-test (obtained from the 
Stata metan command) to test for heterogeneity be- 
tween these associations. These tests for heterogeneity 
were also carried out for comparisons between 
ALSPAC and the meta-analyses from five LMIC co- 
horts. Analyses were carried out using Stata 10 
(Stata Corp, College Station, TX). 



Results 

Prevalence for durations of breastfeeding are dis- 
played in Table 1, showing greater prevalence for 
each category of breastfeeding duration over 1 
month in Pelotas compared with ALSPAC. In add- 
ition, there was a higher prevalence of any breastfeed- 
ing (96.8 vs 84.3%) and exclusive breastfeeding at 2 
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Table 1 Distribution of infants according to duration of 
any breastfeeding 



Breastfeeding 
duration 
(months) 



Prevalence (%) 



Pelotas 



ALSPAC 



0 to <1 


15.6 


36.8 


I to <3 


25.4 


15.6 


3 to <6 


23.6 


13.7 


^6 


35.3 


33.9 


For analyses, tlie categories 
<1 montli were merged as 


of never breastfed and 
the prevalence of never 


breastfed 
breastfed 



in Pelotas was extremely low and there is substantial misclassi- 
fication between these categories.^' 
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Figure 1 (a) Prevalence of breastfeeding (exclusive or 
non-exclusive) at 3 months by family income group and 
(b) prevalence of ever breastfed by family income 



months (65.7 vs 38.3%) in Pelotas compared with 
ALSPAC. 

Figure la and b shows the prevalence of breastfeed- 
ing at 3 months and of ever breastfed according to 
family income (a key indicator of socio-economic 
position). There was no association between family 
income and breastfeeding in Pelotas and a strong 
positive association in ALSPAC, with higher rates of 
breastfeeding observed in the higher income groups. 

To compare between cohorts, the magnitude of the 
associations of socio-economic variables with breast- 
feeding at 3 months and child outcomes, RIls and 



SIIs are presented in Table 2. For each of the indica- 
tors of socio-economic position, there was strong in- 
equality in breastfeeding prevalences between the 
highest and lowest levels of socio-economic position 
in ALSPAC, which was not observed in Pelotas. Rlls 
for breastfeeding were between ~3.5 and ~8.5 times 
greater in ALSPAC compared with Pelotas. Slls for 
child outcomes were similar between ALSPAC and 
Pelotas. In both cohorts, more favourable socio- 
economic position was associated with lower SBP 
and DBP and higher IQ. However, more favourable 
socio-economic position was associated with lower 
BMl in ALSPAC but 'higher' BMl in Pelotas. Rlls 
for breastfeeding were re-assessed using the max- 
imum sample available (Supplementary Table SI 
available as supplementary data at IJE online). 
These were compared with those presented here that 
are based on restricted samples (i.e. only included 
individuals with complete data on breastfeeding, all 
confounders and at least one outcome). Associations 
in the maximal sample were similar to those observed 
in the restricted sample, but with even greater differ- 
ences between the ALSPAC and Pelotas samples (Rlls 
were between ~4.5 and ~10.5 times greater in 
ALSPAC compared with Pelotas). 

Associations of breastfeeding with child BP, BMl 
and IQ are presented in Table 3. In ALSPAC, longer 
duration of breastfeeding was associated with lower 
SBP and DBP in children, in unadjusted analyses. 
However, in Pelotas, child SBP and DBP were not 
strongly associated with duration of breastfeeding. 
Similar findings were observed when associations 
are adjusted for indicators of socio-economic position. 
For BMl in ALSPAC, longer duration of breastfeeding 
was associated with lower child BMl; however, in 
Pelotas there was weaker evidence for associations 
of breastfeeding with child BMl, with a trend towards 
breastfeeding being associated with higher BML 
These findings also remained the same after adjusting 
for indicators of socio-economic position. For IQ, in 
both ALSPAC and Pelotas, longer duration of breast- 
feeding was associated with higher child IQ, with 
similar magnitudes of association between the two 
cohorts. These associations persisted in both cohorts 
after adjustment for indicators of socio-economic pos- 
ition. All Pelotas analyses were weighted to account 
for the oversampling of low-birth- weight infants: 
however, to ensure that there was no effect modifica- 
tion by low birth weight, main associations were 
stratified by low birth weight. Associations were con- 
sistent in low birth weight and normal birth weight 
children (P-values for interaction between birth 
weight and breastfeeding duration for SBP = 0.6, 
DBP = 0.8 and 1Q = 0.96; see Supplementary 
Table S2 available as supplementary data at IJE 
online). For associations of breastfeeding with SBP 
and DBP, analyses were also repeated without adjust- 
ment for child height; results were not substantially 
altered (data not shown). 
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Figure 2a and b displays comparisons of breastfeed- 
ing associations between ALSPAC and Pelotas, and 
between ALSPAC and the large LMIC consortium 
(COHORTS) meta-analyses (iV> 10000).'^ The latter 
comparison supported the trends observed in the 
smaller ALSPAC-Pelotas comparison, i.e. that associ- 
ations of breastfeeding with SBP, DBP and BMI are 
not observed in LMIC cohorts. Although tests for het- 
erogeneity between ALSPAC-Pelotas breastfeeding as- 
sociations suggest differences were robust for BMI (P 
heterogeneity = 0.009) this was not the case for SBP 
(P heterogeneity = 0.6) and DBP (P heterogen- 
eity =0.5). However, with the larger sample from 
the COHORTS consortium, differences between 
ALSPAC and the LMIC meta-analyses were detected 
for associations with SBP (P heterogeneity <0.00I), 
DBP (P heterogeneity = 0.005) and BMI (P heterogen- 
eity <0.001). For IQ, associations with breastfeeding 
were consistent between ALSPAC and Pelotas (P- 
values for heterogeneity = 0.5 and 0.09 for unadjusted 
and adjusted associations, respectively). IQ was not 
reported in the COHORTS consortium meta-analysis. 

As associations may be different for prolonged 
breastfeeding, analyses were repeated exploring asso- 
ciations of breastfeeding for >I year with each out- 
come. Crude and adjusted associations are presented 
in Supplementary Table S2 (available as supplemen- 
tary data at IJE online). In ALSPAC, associations of 
breastfeeding for >I year were generally more strong- 
ly associated with child outcomes than breastfeeding 
^1 year (both versus no breastfeeding), but were in 
the same direction as that reported above for breast- 
feeding duration using categories up to 6 months. In 
Pelotas, stronger associations were observed for 
breastfeeding ^ 1 year compared with ^ 1 year with 
directions consistent with those observed in ALSPAC. 

In Table 4, the results of ALSPAC and Pelotas com- 
parisons for breastfeeding associations with BP, BMI 
and IQ, are compared with results from a breastfeed- 
ing promotion RCT, in which randomization to breast- 
feeding promotion resulted in much longer duration 
of any and exclusive breastfeeding compared with the 
control group.'*' The results from this RCT can be con- 
sidered the gold- standard test of the true causal ef- 
fects of breastfeeding on the outcomes that we have 
assessed here. Consistent with our cohort compari- 
sons, the trial found that breastfeeding increased 
childhood intelligence' '' but did not influence BP or 
BMI."^ 



Discussion 

In this study, we aimed to determine if greater dur- 
ation of breastfeeding has causal effects on child BP, 
BMI and IQ by making comparisons between HIC and 
LMIC cohorts. We first used individual-level data and 
standardized approaches to analyse differences be- 
tween the UK ALSPAC cohort and the Brazilian 
Pelotas 1993 cohort. The advantage of comparing 



associations of breastfeeding with child health be- 
tween ALSPAC and Pelotas is that marked differences 
in the socio-economic patterning of breastfeeding 
were found to exist between the two cohorts. Since 
a major unresolved issue among epidemiological stu- 
dies of breastfeeding associations with later outcomes 
is residual confounding by socio-economic pos- 
ition, comparisons between cohorts where the socio- 
economic patterning of breastfeeding is found to differ 
can potentially improve causal inference with respect 
to effects of breastfeeding on child health. 

In ALSPAC, as is generally the case in HICs,^^ 
breastfeeding was more common among women 
from higher socio-economic groups, whereas in 
Pelotas socio-economic position was not strongly 
associated with breastfeeding. 

In ALSPAC, breastfeeding was associated with all 
outcomes, specifically with lower SBP, DBP and 
BMI, and with higher IQ, even in models adjusting 
for socio-economic position. If these associations are 
not explained by residual confounding due to socio- 
economic position (or other characteristics that are 
socio-economically patterned), then we would expect 
similar magnitudes of association in the Pelotas 
cohort, where socio-economic position could not be 
a confounder since it is unrelated to the exposure. 
What we found in Pelotas was that breastfeeding 
was not strongly associated with SBP, DBP or BMI, 
but was associated with higher IQ, suggesting that 
associations with the former in ALSPAC, and other 
Western population cohorts, are likely to be explained 
by residual confounding, whereas the association with 
IQ is likely to be causal. However, despite observing 
differences between ALSPAC and Pelotas in terms of 
the effect sizes of breastfeeding associations with BP 
and BMI, formal tests for heterogeneity indicated that 
there was little evidence that the associations differed 
statistically between the cohorts. This is likely to be 
due to the smaller sample size of Pelotas and, indeed, 
when ALSPAC associations were compared with a 
larger sample from a LMIC consortium, there was 
evidence of systematic differences between ALSPAC 
and the LMIC cohorts with respect to the associations 
of breastfeeding with SBP, DBP and BMI. 

As a further validation of this cohort comparison 
method, we compared our results with those of an 
RCT"^ '^ and found that it also supported the sugges- 
tion that breastfeeding did not protect against greater 
BMI or BP but did increase IQ. These conclusions are 
further validated by several recent systematic reviews 
and meta-analyses of observational studies relating 
breastfeeding to BMI, BP and cognitive function, 
which also suggest that with BMI^'' and BP^^'^*' asso- 
ciations may be importantly influenced by residual 
confounding, whereas associations with cognitive 
function appear robust and less lil<;ely to be due to 
confounding.^'' 

Although we found evidence of causal effects of 
breastfeeding for child IQ, the crude association was 
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(a) 



(b) 



Child outcome 



Breastfeeding association (per category) 
Effect size (95% CI) 



SBP (mmHg) 
ALSPAC 
Pelotas 

P hetero = 0.6 

DBP (mmHg) 
ALSPAC 
Pelotas 

P hetero = 0.5 

BMI (kg/m^) 
ALSPAC 
Pelotas 

P hetero = 0.009 

IQ 

ALSPAC 
Pelotas 

P hetero = 0.09 



-0.35 (-0.55 to -0.14) 
-0.13 (-0.83 to 0.57) 



-0.16 (-0.31 to -0.01) 
0.05 (-0.50 to 0.60) 



-0.16 (-0.22 to -0.09) 
0.14 (-0.07 to 0.36) 



0.97 (0.62 to 1 .32) 
1 .97 (0.88 to 3.05) 



1.5 



2.5 



3.5 



Child outcome 






Breastfeeding association (per category) 
Effect size (95% CI) 


SBP (mmHg) 
ALSPAC 

COHORTS Consortium 

P hetero <0.001 






-0.35 (-0.55 to -0.14) 
0.12 (-0.01 to 0.24) 


DBP (mmHg) 
ALSPAC 

COHORTS Consortium 

P hetero = 0.005 






-0.16 (-0.31 to -0.01) 
0.10 (-0.01 to 0.20) 


BMI (kg/m^) 
ALSPAC 

COHORTS Consortium 

P hetero <0.001 






-0.16 (-0.22 to -0.09) 
0.04 (0.00 to 0.08) 



1.5 



2.5 



3.5 



Figure 2 (a) Comparison of breastfeeding associations with child outcomes between ALSPAC and Pelotas and (b) comparison 
of breastfeeding associations with child outcomes between ALSPAC and published meta-analyses'^ from the COHORTS 
Consortium based on five LMIC cohorts. Effect sizes reflect the difference in child outcomes per increasing category of grouped 
breastfeeding duration. (ALSPAC/Pelotas: 0 to <1 m, I to <3m, 3 to <6m, 6 months or more; COHORTS: None, sjlm, sS3m, 
<6m, <9ni, <12m, <18m, ^ 24 m, >24 months ) . CI, confidence interval 
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Table 4 Summary of results from the cross-cohort comparison and validation using a randomized trial 



Comparison method 


Validation 


Association with any breastfeeding (per category)^ 


Effect of breastfeeding intervention 


ALSPAC Pelotas 


Belarus 


Strong socio-economic Weak socio-economic 
patterning in breastfeeding patterning in breastfeeding 


Randomized trial 



Outcome 


P 


95% 


CI 


P 


P 


95% CI 


P 


Difference 
in outcome'' 


95% CI 


SBP (mmHg) 


-0.35 


-0.55 to 


-0.14 


0.001 


-0.13 


-0.83 to 0.57 


0.7 


0.2 


-2.9 to 3.3 


DBP (mmHg) 


-0.16 


-0.31 to 


-O.OI 


0.04 


0.05 


-0.50 to 0.60 


0.9 


0.2 


-1.8 to 2.2 


BMI (kgW) 


-0.16 


-0.22 to 


-0.09 


<0.00I 


0.14 


-0.07 to 0.36 


0.2 


O.I 


-0.2 to 0.3 


IQ 


0.97 


0.62 to 


1.32 


<0.00I 


1.97 


0.88 to 3.05 


<0.00I 


5.9 


-1.0 to 12.8 



^None/<l month; 1 to <3 months; 3 to <6 months; ^6 months; fully adjusted models. 
''Intervention vs control. Results extracted from publications from the Belarus PROBIT trial."''''' 



CI, confidence interval. 



found to attenuate substantially following adjustment 
for confounders. It may be argued that a large attenu- 
ation of effects following confounder adjustment sug- 
gests a high likelihood of any remaining associations 
being due to residual confounding. Some of the ad- 
justment factors, such as maternal and paternal edu- 
cation, are strongly related to parental IQ, which has 
a direct link to offspring IQ. Thus, it is possible that 
residual confounding may be a persisting factor in the 
association of breastfeeding with IQ. Indeed, it has 
been found that maternal IQ may be the primary 
driver of the associations between breastfeeding and 
child IQ in HICs.^ However, in a previous study in the 
Philippines, breastfeeding was also found to be posi- 
tively associated with child cognition,^*^ despite an 
inverse association between breastfeeding and socio- 
economic position and the RCT evidence also suggests 
a causal effect,^'' which is consistent with our cohort 
comparisons. 

Strengths and limitations 

The main strength of this article is in exploring causal 
inference when confounding patterns show marked 
differences between the cohorts being compared. 
A further strength is that in both cohorts we were 
able to adjust for a range of socio-economic markers 
(income, occupational social class and parental edu- 
cation), unlike other cohorts where fewer indicators 
are available. Consistent with prospective cohort stu- 
dies, and in particular birth cohorts, there was attri- 
tion over the follow-up period. However, when we 
examined the associations of socio-economic position 
with breastfeeding in the both the maximal number 
in each cohort (i.e. not restricting analyses to only 
those with follow-up outcome data and all 
co-variables) and the samples restricted to complete 
data, the direction and magnitude of these 



associations were similar in both cohorts. This sug- 
gests that those who were followed up in relation to 
child outcomes and analysed here are not a select 
group with respect to the association of 
socio-economic position and breastfeeding. In add- 
ition, as mentioned above, the comparisons between 
ALSPAC and Pelotas may have been underpowered to 
detect some cross-cohort differences in breastfeeding 
associations due to the smaller sample size in the 
Pelotas cohort (iV~ 1000 in Pelotas vs ~5000 in 
ALSPAC). Thus, we included an additional compari- 
son of ALSPAC with a large published meta-analysis 
of breastfeeding associations in five LMICs 
(N>10000)'^ and this provided robust evidence for 
systematic differences between the HIC and LMIC co- 
horts with respect to breastfeeding associations with 
SBP, DBP and BMI, which were consistent with the 
trends observed in the ALSPAC-Pelotas comparison. 
Finally, the interpretation of our findings is based 
on the underlying assumption of this method that 
the differences between the two cohorts are explained 
by the different confounding structures (in particular 
the relationship of socio-economic position to breast- 
feeding) in the two countries. Another possible 
explanation for the discrepant findings between 
them is that the effect of breastfeeding is being mod- 
ified by an unmeasured factor that is only frequent 
in one of the cohorts. For example, breast milk sub- 
stitutes used in the two cohorts might differ. 
However, for this to be a plausible explanation for 
the observed results, the modifying factor (i.e. breast 
milk substitute) would have to be one that only 
has an impact on the association of breastfeeding 
with BMI and BP but not with its association with 
cognitive function. Furthermore, the consistency of 
our cohort comparison results with the findings 
from a large RCT of breastfeeding promotion does 
not support this. 
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Conclusions 

We have demonstrated how comparing associations 
between populations with differing confounding 
structures can be used to explore the likelihood of 
residual confounding and improve causal inference. 
Based on this method, we find that previously re- 
ported associations of breastfeeding with child BMI 
and BP are likely to reflect residual confounding; 
however, we find evidence of a causal relationship 
between breastfeeding and child IQ. This method 
could be extended to other epidemiological associ- 
ations where the confounding structure of the expos- 
ure or outcome varies between different populations 
and where large enough sample sizes are available to 
facilitate reliable comparisons. Different confounding 
structures may be considered to exist where associ- 
ations with exposure and outcome are present in 
one population, and hence could confound the asso- 
ciation, and absent in another for either exposure or 
outcome (or both) as presented here or where the 
association of the potential confounder with either 
exposure or outcome exists in opposite directions be- 
tween populations'^'^® or where the strength of the 
associations with exposure or outcome differs be- 
tween populations.^ Further studies with large 
sample sizes characterizing such exposures that 
differ in their confounding structure among popula- 
tions will be required. 
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KEY MESSAGES 

• Residual confounding remains an unresolved problem in observational epidemiology studies. 

• Comparing associations between populations with different confounding structures can be used to 
improve causal inference. 

• This approach is applied here to cross-cohort comparisons of breastfeeding associations in HICs and 
LMICs where the socio-economic patterning of breastfeeding is known to differ. 

• There was evidence to suggest that although associations of breastfeeding with lower child BP and 
BMI are likely to be due to residual confounding, associations with greater child IQ may be causal. 
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